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Abstract 

Discourse structure is the hidden link be¬ 
tween surface features and document-level 
properties, such as sentiment polarity. We 
show that the discourse analyses produced 
by Rhetorical Structure Theory (RST) 
parsers can improve document-level senti¬ 
ment analysis, via composition of local in¬ 
formation up the discourse tree. First, we 
show that reweighting discourse units ac¬ 
cording to their position in a dependency 
representation of the rhetorical structure 
can yield substantial improvements on 
lexicon-based sentiment analysis. Next, 
we present a recursive neural network 
over the RST structure, which offers sig¬ 
nificant improvements over classification- 
based methods. 

1 Introduction 

Sentiment analysis and opinion mining are among 
the most widely-used applications of language 
technology, impacting both industry and a vari¬ 
ety of other academic disciplines (Feldman, 2013; 
Liu, 2012; Pang and Lee, 2008). Yet senti¬ 
ment analysis is still dominated by bag-of-words 
approaches, and attempts to include additional 
linguistic context typically stop at the sentence 
level (Socher et al., 2013). Since document-level 
opinion mining inherently involves multi-sentence 
texts, it seems that analysis of document-level 
structure should have a role to play. 

A classic example of the potential relevance of 
discourse to sentiment analysis is shown in Fig¬ 
ure 1. In this review of the film The Last Samu¬ 
rai, the positive sentiment words far outnumber 
the negative sentiment words. But the discourse 
Structure — indicated here with Rhetorical Struc¬ 
ture Theory (RST; Mann and Thompson, 1988) — 

*Code is available at https://github.com/ 
parry2403/R2N2 



[It could have been a great movie]^'^ [It does have 
beautiful scenery,]^® [some of the best since Lord of 
the Rings.]^*^ [The acting is weii done,]^° [and i reaiiy 
iiked the son of the leader of the Samurai.]^® [He was 
a likable chap,]^^ [and I hated to see him die.]^*^ [But, 
other than all that, this movie is nothing more than hid- 
den rip-offs.]^^ 

Figure 1: Example adapted from Voll and Taboada 
(2007). 

clearly favors the final sentence, whose polarity 
is negative. This example is illustrative in more 
than one way: it was originally identified by Voll 
and Taboada (2007), who found that manually- 
annotated RST parse trees improved lexicon- 
based sentiment analysis, but that automatically- 
generated parses from the SPADE parser (Soricut 
and Marcu, 2003), which was then state-of-the-art, 
did not. 

Since this time, RST discourse parsing has im¬ 
proved considerably, with the best systems now 
yielding 5-10% greater raw accuracy than SPADE, 
depending on the metric. The time is therefore 
right to reconsider the effectiveness of RST for 
document-level sentiment analysis. In this pa¬ 
per, we present two different ways of combin¬ 
ing RST discourse parses with sentiment analy¬ 
sis. The methods are both relatively simple, and 

















can be used in combination with an “off the shelf” 
discourse parser. We consider the following two 
architectures: 

• Reweighting the contribution of each dis¬ 
course unit, based on its position in a 
dependency-like representation of the dis¬ 
course structure. Such weights can be de¬ 
fined using a simple function, or learned from 
a small of data. 

• Recursively propagating sentiment up 
through the RST parse, in an architecture in¬ 
spired by recursive neural networks (Smolen¬ 
sky, 1990; Socher et al., 2011). 

Both architectures can be used in combination 
with either a lexicon-based sentiment analyzer, or 
a trained classifier. Indeed, for users whose sfarf- 
ing poinf is a lexicon-based approach, a simple 
RST-based reweighfing funclion can offer signif- 
icanf improvemenfs. For fhose who are willing 
fo frain a senfimenf classifier, fhe recursive model 
yields furlher gains. 

2 Background 

2.1 Rhetorical Structure Theory 

RST is a compositional model of discourse struc¬ 
ture, in which elementary discourse units (EDUs) 
are combined intro progressively larger discourse 
units, ultimately covering the entire document. 
Discourse relations may involve a nucleus and a 
satellite, or they may be multinuclear. In the ex¬ 
ample in Figure 1, the unit IC is the satellite of 
a relationship with its nucleus li?; together they 
form a larger discourse unit, which is involved in 
a multinuclear CONJUNCTION relation. 

The nuclearity structure of RST trees suggests 
a natural approach to evaluating the importance 
of segments of text: satellites tend to be less 
important, and nucleii tend to be more impor¬ 
tant (Marcu, 1999). This idea has been leveraged 
extensively in document summarization (Gerani 
et al., 2014; Uzeda et al., 2010; Yoshida et 
al., 2014), and was the inspiration for Voll and 
Taboada (2007), who examined intra-sentential re¬ 
lations, eliminating all words except those in the 
top-most nucleus within each sentence. More re¬ 
cent work focuses on reweighting each discourse 
unit depending on the relations in which it partic¬ 
ipates (Heerschop et al., 2011; Hogenboom et al.. 


2015). We consider such an approach, and com¬ 
pare it with a compositional method, in which sen¬ 
timent polarity is propagated up the discourse tree. 

Marcu (1997) provides the seminal work on 
automatic RST parsing, but there has been a re¬ 
cent spike of interest in this task, with contempo¬ 
rary approaches employing discriminative learn¬ 
ing (Hernault et al., 2010), rich features (Feng 
and Hirst, 2012), structured prediction (Joty et al., 
2015), and representation learning (Ji and Eisen- 
stein, 2014; Fi et al., 2014). With many strong 
systems to choose from, we employ the publicly- 
available DPFP parser (Ji and Eisenstein, 2014),^ 
To our knowledge, this system currently gives the 
best F-measure on relation identification, the most 
difficult subtask of RST parsing. DPFP is a shift- 
reduce parser (Sagae, 2009), and its time complex¬ 
ity is linear in the length of the document. 

2.2 Sentiment analysis 

There is a huge literature on sentiment analy¬ 
sis (Pang and Fee, 2008; Fiu, 2012), with partic¬ 
ular interest in determining the overall sentiment 
polarity (positive or negative) of a document. Bag- 
of-words models are widely used for this task, as 
they offer accuracy that is often very competitive 
with more complex approaches. Given labeled 
data, supervised learning can be applied to obtain 
sentiment weights for each word. However, the 
effectiveness of supervised sentiment analysis de¬ 
pends on having training data in the same domain 
as the target, and this is not always possible. More¬ 
over, in social science applications, the desired 
labels may not correspond directly to positive or 
negative sentiment, but may focus on other cat¬ 
egories, such as politeness (Danescu-Niculescu- 
Mizil et al., 2013), narrative frames (Jurafsky et 
al., 2014), or a multidimensional spectrum of emo¬ 
tions (Kim et al., 2012). In these cases, labeled 
documents may not be available, so users of¬ 
ten employ a simpler method: counting matches 
against lists of words associated with each cate¬ 
gory. Such lists may be built manually from intro¬ 
spection, as in FIWC (Tausczik and Pennebaker, 
2010) and the General Inquirer (Stone, 1966). Al¬ 
ternatively, they may be induced by bootstrapping 
from a seed set of words (Hatzivassiloglou and 
McKeown, 1997; Taboada et al., 2011). While 
lexicon-based methods may be less accurate than 
supervised classifiers, they are easier to apply to 

’https://github.com/jiyfeng/DPLP 




IB ID IE 
i / \ 

1C IF IG 

Figure 2: Dependency-based discourse tree repre¬ 
sentation of the discourse in Figure 1 

new domains and problem settings. Our proposed 
approach can be used in combination with either 
method for sentiment analysis, and in principle, 
could be directly applied to other document-level 
categories, such as politeness. 

2.3 Datasets 

We evaluate on two review datasets. In both cases, 
the goal is to correctly classify the opinion po¬ 
larity as positive or negative. The first dataset 
is comprised of 2000 movie reviews, gathered by 
Pang and Lee (2004). We perform ten-fold cross- 
validation on this data. The second dataset is 
larger, consisting of 50,000 movie reviews, gath¬ 
ered by Socher et al. (2013), with a predefined 
50/50 splif info fraining and fesf sefs. Documenfs 
are scored on a 1-10 scale, and we freaf scores 
< 4 as negafive, > 7 as positive, and ignore scores 
of 5-6 as neufral — allhough in principle nofhing 
prevenls exfension of our approaches lo more fhan 
Iwo senfimenl classes. 

3 Discourse depth reweighting 

Our first approach to incorporating discourse in¬ 
formation into sentiment analysis is based on 
quantifying the importance of each unit of text in 
terms of its discourse depth. To do this, we em¬ 
ploy the dependency-based discourse tree (DEP- 
DT) formulation from prior work on summariza¬ 
tion (Hirao et al., 2013). The DEP-DT formal¬ 
ism converts the constituent-like RST tree into 
a directed graph over elementary discourse units 
(EDUs), in a process that is a close analogue of the 
transformation of a headed syntactic constituent 
parse to a syntactic dependency graph (Kiibler et 
al, 2009). 

The DEP-DT representation of the discourse in 
Eigure 1 in shown in Eigure 2. The graph is con¬ 
structed by propagating “head” information up the 
RST tree; if the elementary discourse unit e* is the 
satellite in a discourse relation headed by ej, then 


there is an edge from ej to e*. Thus, the “depth” 
of each EDU is the number of times in which it is 
embedded in the satellite of a discourse relation. 
The exact algorithm for constructing DEP-DTs is 
given by Hirao et al. (2013). 

Given this representation, we construct a simple 
linear function for weighting the contribution of 
the EDU at depth dp. 

Aj = max(0.5,1 — dj/G). (1) 

Thus, at di = 0, we have A* = 1, and at d* > 3, we 
have Aj = 0.5. Now assume each elementary dis¬ 
course unit contributes a prediction 'ipi = O^Wi, 
where Wi is the bag-of-words vector, and d is a 
vector of weights, which may be either learned or 
specified by a sentiment lexicon. Then the overall 
prediction for a document is given by, 

4/ = ^Ai(0Tm,) = 0^(J^Aimi). (2) 

i i 

Evaluation We apply this approach in combi¬ 
nation with both lexicon-based and classification- 
based sentiment analysis. We use the lexicon of 
Wilson et al. (2005), and set 9j = 1 for words 
marked “positive”, and 6j = —1 for words marked 
negative. Eor classification-based analysis, we set 
6 equal to the weights obtained by training a logis¬ 
tic regression classifier, tuning the regularization 
coefficient on held-out data. 

Results are shown in Table 1. As seen in 
the comparison between lines B1 and Dl, dis¬ 
course depth weighting offers substantial improve¬ 
ments over the bag-of-words approach for lexicon- 
based sentiment analysis, with raw improvements 
of 4—5%. Given the simplicity of this approach — 
which requires only a sentiment lexicon and a dis¬ 
course parser — we strongly recommend the ap¬ 
plication of discourse depth weighting for lexicon- 
based sentiment analysis at the document level. 
However, the improvements for the classification- 
based models are considerably smaller, less than 
1% in both datasets. 

4 Rhetorical Recursive Neural Networks 

Discourse-depth reweighting offers significant im¬ 
provements for lexicon-based sentiment analy¬ 
sis, but the improvements over the more accurate 
classification-based method are meager. We there¬ 
fore turn to a data-driven approach for combining 
sentiment analysis with rhetorical structure theory, 
based on recursive neural networks (Socher et al.. 




Pang & Eee 

Soeher et al. 

Baselines 



Bl. Eexicon 

68.3 

74.9 

B2. Classifier 

82.4 

81.5 

Discourse depth weighting 


Dl. Eexicon 

72.6 

78.9 

D2. Classifier 

82.9 

82.0 

Rhetorical recursive neural network 

Rl. No relations 

83.4 

85.5 

R2. With relations 

84.1 

85.6 


Table 1: Sentiment elassifieation aeeuraeies on 
two movie review datasets (Pang and Lee, 2004; 
Soeher et al., 2013), deseribed in Seetion 2.3. 


2011). The idea is simple: reeursively propagate 
sentiment seores up the RST tree, until the root of 
the doeument is reaehed. For nueleus-satellite dis- 
eourse relations, we have: 

4/, = (3) 

where i indexes a diseourse unit eomposed from 
relation r*, n(i) indieates its nueleus, and s{i) in- 
dieates its satellite. Returning to the example in 
Figure 1, the sentiment seore for the diseourse unit 
obtained by eombining IB and IC is obtained 
from 

Similarly, for multinuelear relations, we have, 

% = tanh( ^ (4) 

jen(i) 

In the base ease, eaeh elementary diseourse unit’s 
sentiment is eonstrueted from its bag-of-words, 
4'j = O^Wi. Beeause the strueture of eaeh doe¬ 
ument is different, the network arehiteeture varies 
in eaeh example; nonetheless, the parameters ean 
be reused aeross all instanees. 

This approaeh, whieh we eall a Rhetorieal Re- 
eursive Neural Network (R2N2), is reminiseent of 
the eompositional model proposed by Soeher et al. 
(2013), where eomposition is over the eonstituents 
of the syntaetie parse of a sentenee, rather than 
the units of a diseourse. However, a erueial differ- 
enee is that in R2N2s, the elements 4' and K are 
scalars: we do not attempt to learn a latent dis¬ 
tributed representation of the sub-doeument units. 
This is beeause diseourse units typieally eomprise 
multiple words, so that aeeurate analysis of the 
sentiment for elementary discourse units is not so 
difficult as accurate analysis of individual words. 


The scores for individual discourse units can be 
computed from a bag-of-words classifier, or, in fu- 
fure work, from a more complex model such as a 
recursive or recurrenf neural network. 

While this neural network structure captures 
the idea of compositionality over the RST tree, 
the most deeply embedded discourse units can be 
heavily down-weighted by the recursive compo¬ 
sition (assuming Kg < Kn): in the most ex¬ 
treme case of a right-branching or left-branching 
structure, the recursive operator may be applied N 
times to the most deeply embedded EDU. In con¬ 
trast, discourse depth reweighting applies a uni¬ 
form weight of 0.5 to all discourse units with depth 
> 3. In the spirit of this approach, we add an addi¬ 
tional component to the network architecture, cap¬ 
turing the bag-of-words for the entire document. 
Thus, at the root node we have: 

4tdoc — ^ T ^^rst-root) (5) 

i 

with 4'rst-root defined recursively from Equations 3 
and 4, 0 indicating the vector of per-word weights, 
and the scalar 7 controlling the tradeoff between 
these two components. 

Learning R2N2 is trained by backpropagating 
from a hinge loss objective; assuming yt G 
{— 1 , 1 } for each document t, we have the loss 
Lt = (1 - 2/t4'doc,t)-r- From this loss, we 
use backpropagation through structure to obtain 
gradients on the parameters (Goller and Kuch- 
ler, 1996). Training is performed using stochas¬ 
tic gradient descent. Eor simplicity, we fol¬ 
low Zirn et al. (2011) and focus on the dis¬ 
tinction between contrastive and non-contrastive 
relations. The set of contrastive relations in¬ 
cludes Contrast, Comparison, Antithe¬ 
sis, Antithesis-e, Consequence-s, Con¬ 
cession, and Problem-Solution. 

Evaluation Results for this approach are shown 
in lines R1 and R2 of Table 1. Even without dis¬ 
tinguishing between discourse relations, we get an 
improvement of more than 3% accuracy on the 
Stanford data, and 0.5% on the smaller Pang & 
Eee dataset. Adding sensitivity to discourse rela¬ 
tions (distinguishing for contrastive and non- 
contrastive relations) offers further improvements 
on the Pang & Eee data, outperforming the base¬ 
line classifier (D2) by 1.3%. 

The accuracy of discourse relation detection is 
only 60% for even the best systems (Ji and Eisen- 



stein, 2014), which may help to explain why re¬ 
lations do not offer a more substantial boost. An 
anonymous reviewer recommended evaluating on 
gold RST parse trees to determine the extent to 
which improvements in RST parsing might trans¬ 
fer to downstream document analysis. Such an 
evaluation would seem to require a large corpus of 
texts with both gold RST parse trees and sentiment 
polarity labels; the SFU Review Corpus (Taboada, 
2008) of 30 review texts offers a starting point, but 
is probably too small to train a competitive senti¬ 
ment analysis system. 

5 Related Work 

Section 2 mentions some especially relevant prior 
work. Other efforts to incorporate RST into 
sentiment analysis have often focused on intra- 
sentential discourse relations (Heerschop et ah, 
2011; Zhou et ah, 2011; Chenlo et ah, 2014), 
rather than relations over the entire document. 
Wang et al. (2012) address sentiment analysis in 
Chinese. Lacking a discourse parser, they focus 
on explicit connectives, using a strategy that is re¬ 
lated to our discourse depth reweighting. Wang 
and Wu (2013) use manually-annotated discourse 
parses in combination with a sentiment lexicon, 
which is automatically updated based on the dis¬ 
course structure. Zirn et al. (2011) use an RST 
parser in a Markov Logic Network, with the goal 
of making polarity predictions at the sub-sentence 
level, rather than improving document-level pre¬ 
diction. None of the prior work considers the sort 
of recurrent compositional model presented here. 

An alternative to RST is to incorporate “shal¬ 
low” discourse structure, such as the relations 
from the Penn Discourse Treebank (PDTB). 
PDTB relations were shown to improve sentence- 
level sentiment analysis by Somasundaran et al. 
(2009), and were incorporated in a model of sen¬ 
timent flow by Wachsmuth et al. (2014). PDTB 
relations are often signaled with explicit discourse 
connectives, and these may be used as a fea¬ 
ture (Trivedi and Eisenstein, 2013; Lazaridou et 
ah, 2013) or as posterior constraints (Yang and 
Cardie, 2014). This prior work on discourse rela¬ 
tions within sentences and between adjacent sen¬ 
tences can be viewed as complementary to our fo¬ 
cus on higher-level discourse relations across the 
entire document. 

There are unfortunately few possibilities for 
direct comparison of our approach against prior 


work. Heerschop et al. (2011) and Wachsmuth et 
al. (2014) also employ the Pang and Lee (2004) 
dataset, but neither of their results are directly 
comparable: Heerschop et al. (2011) exclude doc¬ 
uments that SPADE fails to parse, and Wachsmuth 
et al. (2014) evaluates only on individual sentences 
rather than entire documents. The only possi¬ 
ble direct comparison is with very recent work 
from Hogenboom et al. (2015), who employ a 
weighting scheme that is similar to the approach 
described in Section 3. They evaluate on the Pang 
and Eee data, and consider only lexicon-based 
sentiment analysis, obtaining document-level ac¬ 
curacies between 65% (for the baseline) and 72% 
(for their best discourse-augmented system). Ta¬ 
ble 1 shows that fully supervised methods give 
much stronger performance on this dataset, with 
accuracies more than 10% higher. 

6 Conclusion 

Sentiment polarity analysis has typically relied 
on a “preponderance of evidence” strategy, hop¬ 
ing that the words or sentences representing the 
overall polarity will outweigh those representing 
counterpoints or rhetorical concessions. How¬ 
ever, with the availability of off-the-shelf RST dis¬ 
course parsers, it is now easy to include document- 
level structure in sentiment analysis. We show 
that a simple reweighting approach offers robust 
advantages in lexicon-based sentiment analysis, 
and that a recursive neural network can substan¬ 
tially outperform a bag-of-words classifier. Euture 
work will focus on combining models of discourse 
structure with richer models at the sentence level. 

Acknowledgments Thanks to the anonymous review¬ 
ers for their helpful suggestions on how to improve the paper. 
The following members of the Georgia Tech Computational 
Linguistics Laboratory offered feedback throughout the re¬ 
search process: Naman Goyal, Vinodh Krishan, Umashanthi 
Pavalanathan, Ana Smith, Yijie Wang, and Yi Yang. Several 
class projects in Georgia Tech CS 4650/7650 took alternative 
approaches to the application of discourse parsing to senti¬ 
ment analysis, which was informative to this work; thanks 
particularly to Julia Cochran, Rohit Pathak, Pavan Kumar 
Ramnath, and Bharadwaj Tanikella. This research was sup¬ 
ported by a Google Faculty Research Award, by the National 
Institutes of Health under award number ROIGMI12697-01, 
and by the Air Force Office of Scientific Research. The con¬ 
tent is solely the responsibility of the authors and does not 
necessarily represent the official views of these sponsors. 



References 

Jose M Chenlo, Alexander Hogenboom, and David E 
Losada. 2014. Rhetorical structure theory for po¬ 
larity estimation; An experimental study. Data & 
Knowledge Engineering. 

Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, 
Dan Jurafsky, Jure Leskovec, and Christopher Potts. 
2013. A computational approach to politeness with 
application to social factors. In Proceedings of the 
Association for Computational Linguistics (ACL), 
pages 250-259, Sophia, Bulgaria. 

Ronen Feldman. 2013. Techniques and applica¬ 
tions for sentiment analysis. Communications of the 
ACM, 56(4):82-89. 

Vanessa Wei Feng and Graeme Hirst. 2012. Text-level 
Discourse Parsing with Rich Finguistic Features. In 
Proceedings of the Association for Computational 
Linguistics (ACL), Jeju, Korea. 

Shima Gerani, Yashar Mehdad, Giuseppe Carenini, 
Raymond T Ng, and Bita Nejat. 2014. Abstractive 
summarization of product reviews using discourse 
structure. In Proceedings of the Association for 
Computational Linguistics (ACL), Baltimore, MD. 

Christoph Goller and Andreas Kuchler. 1996. Team¬ 
ing task-dependent distributed representations by 
backpropagation through structure. In Neural Net¬ 
works, IEEE International Conference on, pages 
347-352. IEEE. 

Vasileios Hatzivassiloglou and Kathleen R. McKeown. 
1997. Predicting the semantic orientation of adjec¬ 
tives. In Proceedings of the Association for Compu¬ 
tational Linguistics (ACL), pages 174—181, Madrid, 
Spain. 

Bas Heerschop, Frank Goossen, Alexander Hogen¬ 
boom, Flavius Frasincar, Uzay Kaymak, and Fran- 
ciska de Jong. 2011. Polarity analysis of texts using 
discourse structure. In Proceedings of the 20th ACM 
international conference on Information and knowl¬ 
edge management, pages 1061-1070. ACM. 

Hugo Hernault, Helmut Prendinger, David A. duVerle, 
and Mitsuru Ishizuka. 2010. HIFDA: A Discourse 
Parser Using Support Vector Machine Classification. 
Dialogue and Discourse, 1(3): 1-33. 

Tsutomu Hirao, Yasuhisa Yoshida, Masaaki Nishino, 
Norihito Yasuda, and Masaaki Nagata. 2013. 
Single-document summarization as a tree knapsack 
problem. In Proceedings of Empirical Methods 
for Natural Language Processing (EMNLP), pages 
1515-1520. 

Alexander Hogenboom, Flavius Frasincar, Franciska 
de Jong, and Uzay Kaymak. 2015. Using rhetorical 
stmcture in sentiment analysis. Communications of 
the ACM, 58(7):69-77. 


Yangfeng Ji and Jacob Eisenstein. 2014. Represen¬ 
tation learning for text-level discourse parsing. In 
Proceedings of the Association for Computational 
Linguistics (ACL), Baltimore, MD. 

Shahq Joty, Giuseppe Carenini, and Raymond Ng. 
2015. CODRA: A novel discriminative framework 
for rhetorical analysis. Computational Linguistics, 
41(3). 

Dan Jurafsky, Victor Chahuneau, Bryan R Routledge, 
and Noah A Smith. 2014. Narrative framing of con¬ 
sumer sentiment in online restaurant reviews. Pirst 
Monday, 19(4). 

Suin Kim, JinYeong Bak, and Alice Haeyun Oh. 2012. 
Do you feel what i feel? social aspects of emotions 
in twitter conversations. In Proceedings of the In¬ 
ternational Conference on Web and Social Media 
(ICWSM), Menlo Park, California. AAAI Publica¬ 
tions. 

Sandra Kiibler, Ryan McDonald, and Joakim Nivre. 
2009. Dependency parsing. Synthesis Lectures on 
Human Language Technologies, 1(1):1-127. 

Angeliki Fazaridou, Ivan Titov, and Caroline 
Sporleder. 2013. A bayesian model for joint 

unsupervised induction of sentiment, aspect and 
discourse representations. In Proceedings of the 
Association for Computational Linguistics (ACL), 
pages 1630-1639, Sophia, Bulgaria. 

Jiwei Fi, Rumeng Fi, and Eduard Hovy. 2014. Recur¬ 
sive deep models for discourse parsing. In Proceed¬ 
ings of Empirical Methods for Natural Language 
Processing (EMNLP). 

Bing Fiu. 2012. Sentiment analysis and opinion min¬ 
ing. Synthesis Lectures on Human Language Tech¬ 
nologies, 5(1): 1-167. 

William Mann. 1984. Discourse structures for text 
generation. In Proceedings of the 10th International 
Conference on Computational Linguistics and 22nd 
annual meeting on Association for Computational 
Linguistics, pages 367-375. Association for Com¬ 
putational Finguistics. 

Daniel Marcu. 1997. The rhetorical parsing of natu¬ 
ral language texts. In Proceedings of the European 
Chapter of the Association for Computational Lin¬ 
guistics (EACL), pages 96-103. 

Daniel Marcu. 1999. The automatic construction of 
large-scale corpora for summarization research. In 
Proceedings of the 22nd annual international ACM 
SIGIR conference on Research and development in 
information retrieval, pages 137-144. ACM. 

Bo Pang and Fillian Fee. 2004. A sentimental edu¬ 
cation; Sentiment analysis using subjectivity sum¬ 
marization based on minimum cuts. In Proceed¬ 
ings of the Association for Computational Linguis¬ 
tics (ACL), pages 271-278. 



Bo Pang and Lillian Lee. 2008. Opinion mining and 
sentiment analysis. Foundations and trends in infor¬ 
mation retrieval, 2(1-2): 1-135. 

Kenji Sagae. 2009. Analysis of Discourse Struc¬ 
ture with Syntactic Dependencies and Data-Driven 
Shift-Reduce Parsing. In Proceedings of the 11th 
International Conference on Parsing Technologies 
(1WPT’09), pages 81-84, Paris, France, October. 
Association for Computational Linguistics. 

Paul Smolensky. 1990. Tensor product variable bind¬ 
ing and the representation of symbolic structures 
in connectionist systems. Artificial intelligence, 
46(1):159-216. 

Richard Socher, Cliff C. Lin, Andrew Y. Ng, and 
Christopher D. Manning. 2011. Parsing Natural 
Scenes and Natural Language with Recursive Neu¬ 
ral Networks. In Proceedings of the International 
Conference on Machine Learning (ICML), Seattle, 
WA. 

Richard Socher, Alex Perelygin, Jean Y Wu, Jason 
Chuang, Christopher D Manning, Andrew Y Ng, 
and Christopher Potts. 2013. Recursive deep mod¬ 
els for semantic compositionality over a sentiment 
treebank. In Proceedings of Empirical Methods for 
Natural Language Processing (EMNLP). 

Swapna Somasundaran, Galileo Namata, Janyce 
Wiebe, and Lise Getoor. 2009. Supervised and 
unsupervised methods in employing discourse rela¬ 
tions for improving opinion polarity classification. 
In Proceedings of Empirical Methods for Natural 
Language Processing (EMNLP), Singapore. 

Radu Soricut and Daniel Marcu. 2003. Sentence 
level discourse parsing using syntactic and lexical 
information. In Proceedings of the North American 
Chapter of the Association for Computational Lin¬ 
guistics (NAACL), pages 149-156. 

Philip J. Stone. 1966. The General Inquirer: A 
Computer Approach to Content Analysis. The MIT 
Press. 

Maite Taboada, Julian Brooke, Milan Tofiloski, Kim¬ 
berly Voll, and Manfred Stede. 2011. Lexicon- 
based methods for sentiment analysis. Computa¬ 
tional linguistics, 37(2):267-307. 

Maite Taboada. 2008. SFU review cor¬ 
pus. http: //www .sfu.ca/~mtaboada/ 
research/SFU_Review_Corpus.html. 

Yla R Tausczik and James W Pennebaker. 2010. The 
psychological meaning of words: LIWC and com¬ 
puterized text analysis methods. Journal of Lan¬ 
guage and Social Psychology, 29(l):24-54. 

Rakshit Trivedi and Jacob Eisenstein. 2013. Discourse 
connectors for latent subjectivity in sentiment anal¬ 
ysis. In Proceedings of the North American Chap¬ 
ter of the Association for Computational Linguistics 
(NAACL), pages 808-813, Stroudsburg, Pennsylva¬ 
nia. Association for Computational Linguistics. 


Vinicius Rodrigues Uzeda, Thiago Alexan¬ 
dre Salgueiro Pardo, and Maria Das Gragas Volpe 
Nunes. 2010. A comprehensive comparative eval¬ 
uation of rst-based summarization methods. ACM 
Transactions on Speech and Language Processing 
(TSLP), 6(4):4. 

Kimberly Voll and Maite Taboada. 2007. Not all 
words are created equal: Extracting semantic orien¬ 
tation as a function of adjective relevance. In Pro¬ 
ceedings of Australian Conference on Artificial In¬ 
telligence. 

Henning Wachsmuth, Martin Trenkmann, Benno Stein, 
and Gregor Engels. 2014. Modeling review argu¬ 
mentation for robust sentiment analysis. In Proceed¬ 
ings of the International Conference on Computa¬ 
tional Linguistics (COLING). 

Eei Wang and Yunfang Wu. 2013. Exploiting hierar¬ 
chical discourse structure for review sentiment anal¬ 
ysis. In Proceedings of the Conference on Asian 
Language Processing (lALP), pages 121-124. 

Eei Wang, Yunfang Wu, and Likun Qiu. 2012. Ex¬ 
ploiting discourse relations for sentiment analysis. 
In Proceedings of the International Conference on 
Computational Linguistics (COLING), pages 1311- 
1320, Mumbai, India. 

Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 
2005. Recognizing contextual polarity in phrase- 
level sentiment analysis. In Proceedings of Em¬ 
pirical Methods for Natural Language Processing 
(EMNLP), pages 347-354. 

Bishan Yang and Claire Cardie. 2014. Context-aware 
learning for sentence-level sentiment analysis with 
posterior regularization. In Proceedings of the As- 
sociation for Computational Linguistics (ACL), Bal¬ 
timore, MD. 

Yasuhisa Yoshida, Jun Suzuki, Tsutomu Hirao, and 
Masaaki Nagata. 2014. Dependency-based Dis¬ 
course Parser for Single-Document Summarization. 
In Proceedings of Empirical Methods for Natural 
Language Processing (EMNLP). 

Lanjun Zhou, Binyang Li, Wei Gao, Zhongyu Wei, 
and Kam-Eai Wong. 2011. Unsupervised dis¬ 
covery of discourse relations for eliminating intra¬ 
sentence polarity ambiguities. In Proceedings of 
Empirical Methods for Natural Language Process¬ 
ing (EMNLP), pages 162-171. 

Cacilia Zirn, Mathias Niepert, Heiner Stuckenschmidt, 
and Michael Strube. 2011. Pine-grained sentiment 
analysis with structural features. In Proceedings of 
the International Joint Conference on Natural Lan¬ 
guage Processing (IJCNLP), pages 336-344, Chi- 
ang Mai, Thailand. 



